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NUCLEIC ACID MUTATION DETECTION BY ANALYSIS OF SPUTUM 

This work was supported by a grant from the National Cancer Institute (NCI 
Grant #1 P50 CA581 84-01). The United States Government may retain certain 
5 rights in the invention. 

RACKt5RQUND OF THE INVENTION 
1. PlAldoftha Invention 

This invention relates to a method of detecting a target nucleic acid in sputum 
and reagents useful therein. 

10 2. Daafsripflon of Ralatad Art 

An increasing body of evidence implicates somatic mutations as causally 
important in the induction of human cancers. These somatic mutations may 
accumulate in the genomes of previously normal odis, some of which may then 
demonstrate the phenotypes associated with mafignant growth. Such oncogenic 
1 5 mutations may include a number of different types of alterations in DNA structure, 
including deletions, translocations and single nucleotide alterations. The latter, 
also known as point mutations, may frequently be bwblved in carcinogenesis, in 
that a variety of mutagenic chemicals induce such mutatrans. In addition, such 
mutations may occur spontaneously as a result of mistakes in DNA repiicatnn. 

20 Advances in recombinant DNA technology have led to the discovery of normal 
cellular genes (proto-oncogenes and tumor suppressor genes) which control 
growth, devetopment, and differentiation. Under certain circumstances, 
regulation of these genes is altered and cause nonnal cells to assume neoplastic 
growth characteristics. There are over 40 known proto-onoogenes and 
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suppressor genes to date, which fail into various categories depending on their 
functional characteristics. These include. (1) growth factors and growth factor 
receptors, (2) messengers of intracellular signal transduction pathways, for 
example, t>etween the cytoplasm and the nucleus, and (3) regulatory proteins 
5 influencing gene expression and DMA replication. 

Point mutations have been directly implicated in the causation of many human 
tumors. Some tumors carry oncogenes of the ras gene family, which differ from 
their normal cellular counterpart proto-oncogenes by the presence of a point 
mutation at one of a limited number of sites in these genes. Similarly, point 

10 mutations in critical regions of tumor suppressor genes, such as p53, are often 
detected in tumor cells. These mutations represent qualitative changes in the 
tumor cell genome which distinguish these celts from normal cells and provide 
a basis for diagnosis of the genetic origin of a timor under study. Identification 
of the mutations that have created active oncogenes may provide important 

1 5 diagnostic and prognostic dues for tumor development For example, a number 
of mutations have been found to alter the 12th codon of the ras oncogenes, 
causing replacement of a normally present glycine by any of a number of 
alternative amino acid residues. Such amino acid substitutions create a potent 
transfomning allele. Thus, the presence of a particular nucleotide substitution 

20 may t>e a strong detemiinant of the behavior of the tumor cell (e.g., its rate of 
growth, invasiveness, etc.). As a result, DNA probes for oncogene mutations 
have promise as diagnostic reagents in dinical oncology. 

Among the various types of neoplasms, a number of those which are found in the 
lungs are assodated with oncogenic mutations. Lung cancer is the leading 
25 cause of cancer related deaths in Western countries. The prognosis for patients 
with lung cancer is primarily dependent on the stage of the tumor at the time of 
clinical diagnosis. Currently, only 25 to 40 percent of all lung tumors are 
considered resectable at the time of initial assessment Patients diagnosed eariy 
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with stage I tumors have a 40-70% survival following surgical resection. An 
attempt at lung cancer screening through the use of tri-annual sputum cytology 
and annual chest x-ray has proven inadequate for the early detection of lung 
cancer. Altematlvely, the finding that tumors progress through a series of well- 
5 defined genetic changes, including point mutations in oncogenes, has stimulated 
efforts to develop addrtional. non-Invasive tests that could more reliably detect 
neoplasms of the lung, such attempts have failed. The present invention 
addresses this need. 
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SUMMARY OF THE INVgMTIQN 

The present invention arose from the unexpected finding that nudeic acid having 
a mutant nucleotide sequence associated with lung neoplasia is present in 
detectable levels in sputum specimens from patients with lung neoplasia. 

5 As a consequence of this discovery, the present invention represents a 
significant advance over such techniques as tissue biopsy by providing a non- 
invasive, rapid, and accurate method for detecting mutant nucleotide sequences 
associated with lung neoplasia. The DNA amplification based approach of the 
invention can identify one cell carrying a mutant gene among a large excess 
1 0 (greater than 10.000) of normal ceils. Based on this finding, it is now possible to 
detect various other target nucleic acids associated with other disease states. 

The present Invention provides a method which can be used as an adjunct to 
cytopathology. to screen high-risk populations and to monitor high risk patients 
undergoing chemopreventton or chemotherapy. 



DESCRIPTION OF THE DRAWINGS 



FIGURE 1 shows gene mutations in sputum. Clones with a PGR insert from the 
p53 gene hybridized to an oligomer probe spedfic for the codon 273 Arg-)His 
mutation in the patients (L4) tumor (T). fewer clones hybridized to the same 
probe in the patient's sputum (S) and no clones hybridized to the probe in a 
control sputum from a patient (LI 5) without a p53 gene mutation in the patient's 
primary lung cancer (C). 

FIGURE 2 shows sequencing gels of clones which contained the p53.nonnal and 
mutant sequences. The wild type sequence is seen in nonnal (N) control and the 
273 CGT->CAT (Arg-)His) mutant sequence (arrow) in the tumor (T) with a 
residual wild type band. The same mutant band is seen in sputum (S) as that 
found in the primary tumor. 



DETAILED DESCRIPTinM OF THE IMVENTIQN 

The present invention relates to a method of detecting a nudeic acid having a 
mutant nucleotide sequence present in sputum, wherein the presence of the 
altered nucleic acid sequence is associated with neoplasia of the lung. 

5 In its broadest sense, the present invention allows the detection of any target 
nucleic acid sequence of diagnostic or therapeutic relevance, where the target 
nucleic acid sequence Is present in sputum. Thus, the target nucleotide 
sequence may be, for example, a mutant nucleotide, a restriction fragment length 
polymorphism (RFLP). a nucleotide deletion, a nucleotide substitution, or any 
1 0 other mammalian nucleic add sequence of interest. 

In one embodiment the method of the invention is applicable for detection of 
mutant nudeotide sequences associated with benign as wall as malignant 
neoplasias. In a preferred embodiment neoplasia of the lung is detected, 
although the method can be used to detect any neoplastic mutant nudeotide 
1 5 sequence, regardless of origm, as long as the sequence is detectably present in 
sputum. For example, head and nedc cancers shed cancer cells into sputum and 
can be detected. 

Numerous nudeic acids having mutant nucleotide sequences that produce an 
abnormal gene product are known to be associated with various neoplasias. 

20 Among the most common mutant nudeotide sequences are oncogenes and 
tumor suppressor genes, such as the K-ras mutant oncogene, mutated in colon 
cancer (MCC), deleted in colon cancer (DCC), adenomatous polyposis coli 
(APC), femilial adenomatous pol^sis coil (FAP) and p53. Of spedal 
significance in the present invention is the detection of the K-ras mutant 

25 oncogene and the p53 tumor suppressor gene (Vogelstein, Nature, 348:681. 
1990). 
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In order to analyze sputum specimens according to the method of the invention, 
it is preferable to enrich for epithelial cells present in the specimen. This may be 
accomplished by mixing the sample with an epithelial cell specific monoclonal 
antibody, such as EBA-1 or Ber-Ep4 (Can. Res. £2:3455, 1993) available fi-om 
5 Dakopatts. Gestrop Denmark.) Other epithial cell specific antibodies will be 
known to those of skill in the art. 



Amino acids referred to herein may be identified according to the fbltowing three- 
letter or one-letter abbreviations: 



Three-Letter One-Letter 
10 Amino Add AbbrevlatlQn AbbrBviation 





L-Alanine 


Ala 


A 




L-Argintne 


Afg 


R 




LrAsparagine 


Asn 


N 




L-AsparticAckl 


Asp 


D 


15 


L-Cysteine 


Cys 


C 




L-Glutamine 


Gin 


Q 




L-Glutamic Acid 


Glu 


E 




L-Glydne 


Gly 


G 




L-Histidine 


His 


H 


20 


L-lsoleudne 


lie 


1 




L-Leucine 


Leu 


L 




L-Lysine 


Lys 


K 




L-Methionine 


Met 


M 




L-Phenylalanine 


Phe 


F 


25 


L-Prollne 


Pro 


P 




L-Serine 


Ser 


S 




L-Threonine 


•Thr 


T 




L-Tryptophan 


Trp 


W 




L-Tyrosine 


Tyr 


Y 


30 


L-Vaiine 


Vat 


V 
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When It is desired to amplify the mutant nucleotide sequence before detection, 
this can be accomplished using oligonucleotide(s) which are primers for 
amplification. These unique oligonucleotide primers are based upon 
identification of the flanking regions contiguous with the mutant nucleotide 

5 sequence and are capable of substantially hybridizing with the flanking regions 
so that amplification can proceed. For example, in the case of K-ras, these 
oligonucleotide primers comprise sequences such as nucleotkJe sequence 5 - 
AGGAATTCATGACTGAATATAAACTTGT-3' (SEQ. ID NO. 1) and/or 
5'-ATCGAATTCTATGCATA7TAAAACAAGA1T.3' (SEQ. ID NO. 2) and 

10 sequences complementary thereto, in the case of p53, the oligonucleotide 
primers comprise sequences which are capable of hybridizing with the flanking 
nucleotide sequence. wherein the primers are 5- 
GTAGGAATTCACTTGTGCCCTGACTT-3' (SEQ. ID NO. 3) and 
5'-CATCGAATrCCACTGACAACCACCCTT-3' (SEQ. ID NO. 4) (exons 5-6) and 

15 5'-GTAGGAATTCCAAGGCGCACTGGCCTC-3' (SEQ. ID NO. 5) and 5'- 
ACTGAATTCTTCGTCTCCTCCACC6C-3' (SEQ. ID NO. 6) for exons 7-8, and 
sequences complementary thereto. 

The primers which can be used according to the method of the invention 
embrace oilgonucieotides of sufficient length and appropriate sequence so as to 

20 provide specific initiation of polymerization of a significant number of nudeicacki 
molecules containing the target nucleic add. In this manner, it is possible to 
selectively amplify the specific target nucleic acid sequence containing the 
nudeic add of Interest Spedficaliy, the temi "primer* as used herein refers to 
a sequence comprising two or more deoxyribonucleotides or ribonucleotides. 

25 preferably at least eight, which sequence is capable of initiating synthesis of a 
primer extension product, which is substantially complementary to a target 
nudeic add strand. The oligonudeotide primer typically contains 15-22 or more 
nudeotides, although it may contain fewer nudeotkles. 
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Experimental conditions conducive to synthesis include the presence of 
nucleoside triphosphates and an agent for polymerization, such as DMA 
polymerase, and a suitable temperature and pH. The primer is preferably single 
stranded for maximum efficiency in amplification but may be double-stranded. 
5 If double-stranded, the primer is first treated to separate its strands before being 
used to prepare extension products. Preferably, the primer is an oligodeoxyribo^ 
nucleotide. The primer must be sufRciently long to prime the synthesis of 
extension products in the presence of the inducing agent for polymerization. The 
exact length of primer will depend on many factors, including temperature, buffer, 
10 and nucleotide composition. 

Primere used according to the method of the invention are designed to be 
"substantially" complementary to each strand of target nucleotide sequence to 
be amplified. Substantially complementary means that the primers must be 
suffidentiy con^iementary to hybridize with their respective strands under 
1 5 conditions which allow the agent for polymerization to function. In other words, 
the primers should have sufRdent complementarity with the flanking sequences 
to hybridize therewith and penntt amplification of the nucleotide sequence. 
Preferably, the 3' terminus of the primer that is extended has perfiectly base 
paired complementarity with the complementary flanking strand. 

20 Oligonucleotkle primers used according to the invention are employed in any 
amplificatton process that produces increased quantities of target nucleic add. 
Typically, one primer is complementary to ttie negative (-) strand of the mutant 
nucleotide sequence and the other is complementary to the positive (-•-) strand. 
Annealing the primers to denatured nudeic add followed by extension witii an 

25 enzyme, such as the large fragment of DMA Polymerase I (Klenow) or Taq DNA 
polymerase and nucleotides results in newly synthesized + and -strands 
containing the target nudeic add. Because tiiese newly synthesized nudeic 
adds are also templates, repeated cydes of denaturing, primer annealing, and 
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extension results in exponential production of the region (i.e., the target mutant 
nucleotide sequence) defined by the primer. The product of the amplification 
reaction is a discrete nucleic acid duplex with termini corresponding to the ends 
of the specific primers employed. Those of skill in the art will know of other 
amplification methodologies which can also be utilized to increase the copy 
number of target nucleic acid. 

The oligonucleotkle primers for use in the invention may be prepared using any 
suitable method, such as conventional phosphotriester and phosphodlester 
methods or automated embodiments thereof, in one such automated embodt* 
ment diethylphosphoramidites are used as starting materials and may be 
synthesized as described by Beaucage, et aL {Tetrahedron Letters, 22:1859- 
1862, 1981). One method for synttiesizing oligonucleotides on a modified solid 
support is described in U.S. Patent No. 4,458,066. One method of amplification 
which can be used according to this invention is the polymerase chain reaction 
(PGR) described in U.S. Patent Nos. 4,683.202 and 4.683,195. 

Any sputum spwimen nucleic add, in purified or nonpurified fbmi, can be utilized 
as the starting nucleic ackl or acids, provkled it contains, or is suspected of 
containing, the specific nucleic add sequence containing the target nudek: add. 
Thus, the process may employ, for example, DMA or RNA, induding messenger 
RNA, wherein DMA or RNA may be single stranded or double stranded, if RNA 
is to be used as a template, enzymes, and/or conditions optimal for reverse 
transcribing the template to DNA would be utilized. In addition, a DNA-RNA 
hybrid which contains one strand of each may be utilized. A mixture of nucleic 
adds may also be employed, or the nucleic adds produced in a previous 
amplificatk)n reaction herein, using the same or different primers may be so 
utilized. The mutant nucleotide sequence to be amplified, may be a fraction of 
a larger molecule or can be present initially as a discrete molecule, so that the 
specific sequence constitutes the entire nucleic add. It is not necessary that the 
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sequence to be amplified be present initially in a pure form; it may be a minor 
fraction of a complex mixture, such as contained in whole human DNA. 

Where the target mutant nucleotide sequence of the sample contains two 
strands, it is necessary to separate the strands of the nucleic acid before it can 
5 be used as the template. Strand separation can be effected either as a separate 
step or simultaneously with the synthesis of the prinier extension products. This 
strand separation can be accomplished using various suitable denaturing 
conditions, including physical, chemical, or enzymatic means; the word 
"denaturing" includes all such means. One physical method of separating nudeic 

1 0 add straiKis invoivfis heating the nudeic add until It is denatured. Typical heat 
denaturation may involve temperatures ranging from about 80* to 105'C for 
times ranging from about 1 to 10 minutes. Strand separation may also be 
induced by an enzyme from the dass of enzymes known as heiicases or by the 
enzyme RecA. which has heiicase activity, and in the presence of riboATP. is 

1 5 known to denature ONA. The reaction conditions suitable for strand separation 
of nudeic adds witti heinases are described by Kuhn Hoffmann-Berling (CSH- 
Quantitative Biology, ^:63, 1978) and technques for using RecA are reviewed 
in C. Redding {Ann. Rev. Genetics, jlfi:405-437. 1082). 

If the nudeic add containing ttie target nucleic add to be amplified is single 
20 stranded, its complement is synthesized by adding one or two ofigonudeotide 
primers. If a single primer is utilized, a primer extenston produd is synthesized 
in the presence of primer, an agent for polymerization, and the four nucleoskle 
triphosphates described below. The produd %vifl be complementary to the 
single-stranded nudeic add and will hybridize with a single-stranded nucleic add 
25 to fbnn a duplex of unequal length strands ttiat may ttien be separated into single 
strands to produce two single separated complementary strands. Alternatively, 
two primers may be added to the single-stranded nucleic acid and the reaction 
carried out as described. 
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When complementary strands of nucleic acid or adds are separated, regardless 
of whether the nucleic acid was originally double or single stranded, the 
separated strands are ready to be used as a template for the synthesis of 
additional nucleic acid strands. This synthesis is performed under conditions 
allowing hybridization of primers to templates to occur. Generally synthesis 
occurs in a buffered aqueous solution, preferably at a pH of 7-8, most preferably 
about 8. Preferably, a molar excess (for genomic nucleic add, usually about 
lO'il primertemplate) of the two oligonucleotide primers is added to the buffer 
containing the separated template stands. It is understood, however, that the 
amount of complementary strand may not be known if the process of the 
invention is used for diagnostic applications, so that the anrount of primer relative 
to the amount of complementary strand cannot be detemiined with certainty. As 
a practical matter, however, the amount of primer added wW generally be in 
molar excess over the amount of complementary strand (template) when the 
sequence to be amplified is contained in a mixture of complicated long-chain 
nudeic add strands. A large molar excess is preferred to ffnprove the effidency 
of the process. 

In some ampiification embodiments* the substrates, for example, the 
deoxyribonucieotide triphosphates dATP. dCTP, dGTP, and dTlP, are added to 
the synthesis mixture, either separately or together with the primers, in adequate 
amounts and the resulting solution is heated to about 90*-100*C from about 1 to 
10 minutes, preferably from 1 to 4 minutes. Afterthis heating period, the solution 
is allowed to cool to room temperature, which is preferabte for the primer 
hybridization. To the cooled mixture is added an appropriate agent for effecting 
the primer extension reaction (called herein "agent for polymerization")* and the 
reaction is allowed to occur under conditions icnown in ttie art. The agent for 
polymerization may also be added together witti the other reagents if it is heat 
stable. This synthesis (or ampiification) reaction may occur at room temperature 
up to a tenDperature above which the agent for polymerization no longer 
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functions. Thus, for example, if DNA polymerase is used as the agent, the 
temperature is generally no greater than about 40*C. Most conveniently the 
reaction occurs at room temperature. 

The agent for polymerization may be any compound or system which will function 

5 to accomplish the synthesis of primer extension products, including enzymes. 
Suitable enzymes for this purpose Include, for example. £ coli DMA polymerase 
I, Taq polymerase. Klenow fragment of E. coli DMA polymerase I, T4 DNA 
polymerase, other available DNA polymerases, polymerase mutelns. reverse 
transcriptase, ligase, and other enzymes, Including heat-stable enzymes (i.e., 

10 those enzymes which perfbmi primer extension after being subjected to 
temperatures sufRciently elevated to cause denaturation). Suitable enzymes will 
facilitate combination of the nucleotides In the proper manner to fomn the primer 
extension products which are complementary to each mutant nucleotide strand. 
Generally, the synthesis will be initiated at the 3' end of each primer and proceed 

15 In the 5' direction along the template strand, until synthesis temiinates, producing 
molecules of different lengths. There may be agents for polymerization, 
however, which initiate synthesis at the 5' end and proceed in the other direction, 
using the same process as described above. In any event the method of the 
invention is not to be fimited to the embodiments of amplification which are 

20 described herein. 

The newly synthesized mutant nucleotide strand and its complementary nudeic 
acid strand will form a double-stranded molecule under hybridizing conditions 
described above and this hybrid is used In subsequent steps of the process. 

In the next step, the newly synthesized double-stranded molecule is subjected 
25 to denaturing conditions using any of the procedures described above to provide 
single-stranded molecules. 
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The above process is repeated on the single-stranded molecules. Additional 
agent for polynienzation, nucleosides, and primers may be added, if necessary, 
for the reaction to proceed under the conditions prescribed above. Again, the 
synthesis will be initiated at one end of each of the oligonucleotide primers and 
will proceed along the single strands of the template to produce additional nucleic 
acid. After this step, half of the extension product will consist of the specific 
nucleic acid sequence bounded by the two primers. 

The steps of denaturing and extension product synthesis can be repeated as 
often as needed to amplify the target mutant nucleotide sequence to the extent 
necessary for detection. The amount of the mutant nucleotide sequence 
produced will accumulate in an exponential feshion. 

The amplified product may be detected by analyzing it by Southem blots without 
using radioactive probes. In such a process, for example, a small sample of DNA 
containing a very low level of mutant nucleotide sequence is amplified, and 
analyzed via a Southem blotting technique. The use of non*radioactive probes 
or labels is factBtated by the high level of the amplified signal. 

Nudeic acids having a mutation detected in the method of the invention can be 
further evaluated, detected, doned, sequenced, and the Wke, either in solution 
or after binding to a sdid support, by any niethod usually applied to the detection 
of a specific DNA sequence such as PGR, oligomer restriction (Saiki, et al.. 
Bio/Technology, 2:1008-1012, 1985), allele-spedfic oligonucleotide (ASO) probe 
analysis (Conner, ef a/., Prac. NaO. Acad Sd. USA, fifl:278, 1983), 
oligonudeotide ligation assays (OLAs) (Landegren, et al.. Science, 241:1077. 
1988), and the liice. Molecular techniques for DNA analysis have been reviewed 
(Landegren, et ai, Science, 242:229-237. 1988). Thus, in a prefen^d 
embodiment where the mutant nudeotide sequence to be detected is K-ras, a 
hybridization probe is utilized which is capable of hybridizing with mutant 
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nucleotide sequences comprising 5-TTGCCTACGCCAACAGCTCC-3' (Val") 
(SEQ. ID NO. 7). 5*-TTGCCTACGCCATCAGCTCC-3' (Asp^^ (SEQ. ID NO. 8). 
5'-TTGCCTACGCCACTAGCTCC-3' (Ser'^ (SEQ. ID NO. 9). or 5'- 
TTGCCTACGCCACAAGCTCC-3' (Cys'*) (SEQ. ID NO. 10) and sequences 

5 complementary thereto. Where the mutant nucleotide sequence to be detected 
is p53, a hybridization probe is utilized which is capable of hybridizing with 
mutant nucleotide sequences comprising 5'-CACAAACATGCACCTCAA-3' 
(His^) (SEQ. ID NO. 11) or 5'-TGCGCCGGCCTCTCCCA-3* (Gly^^) (SEQ. ID 
NO. 12) and sequences complementary thereto. The wild type K-ras and wild 

1 0 type pS3 are detected by hybridizing with nucleotide probes which hybridize with 
nucleotide sequences comprising 5'-TTGCCTACGCCACCAGCTCC-3' (SEQ. ID 
NO. 13) and 5'-CCGGTTCATGGCGCCCAT-3' (SEQ. ID NO. 14); respectively. 

In an embodiment of the invention, purified nucleic acid fragments containing 
intervening sequences or oligonucieotide sequences of 10-50 base pairs are 

1 5 radioactiveiy labelled. The labelled preparations are used to probe nucleic acid 
from sputum by the Southem hybridization technique. Nucleotide fragments from 
sputum, before or after amplification, are separated into fragments of different 
molecular masses by gel electrophoresis and transferred to filters which bind 
nucleic acid. After exposure to the labelled probe, which win hybridize to 

20 nucleotide fragments conteining target nucleic add sequences, binding of the 
radioactive probe to terget nudeic add fragments is identified by autoradiography 
(see GeneUc Engineering, 1 , ed. Robert Wiliamson, Academic Press. (1 981 ), 72- 
81). Attematively, nucleic add from sputum can t>e bound direcUy to filters to 
which the radioactive probe selectively binds nud^ adds having the sequence 

25 of interest specific sequences and the degree of binding is quantitated by directly 
counting the radioactive errassions. Where the terget nucleic acid is not 
amplified, detection using an appropriate hybridization probe may be perfomned 
direcUy on the separated mammalian nudeic add. In those instances where the 
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target nudeic acid is amplified, detection with the appropriate hybridization probe 
would be performed after amplification. 

The probes of the present invention can be used for examining the distribution 
of the specific fragments detected, as well as the quantitative (relative) degree 
5 of binding of the probe for determining ttie occurrence of specific strongly binding 
(hybridizing) sequences, thus indicating the likelihood for an individual to be at 
low risk or high risk for neoplastic disease, such as a lung carcinoma. Further 
cloning allows specific evaluation of the number of mutant nucleotides (i.e.. 
mutant cells) allowing a precise estimate of risk to develop neoplastic disease. 

10 For the most part, the probe will be labelled with an atom or inorganic radical, 
most commonly using radionucikies, but also perhaps heavy metals. 
Conveniently, a radk>active label may be empkiyed. Radk>active labels include 
'^P. 'H. ^^C, or ttie like. Any radk)active label may be employed which 
provkjes for an adequate signal and has sufRdent halMife. Other labels include 

15 ligands, which can sen^e as a specific binding pair member for a latielled ligand. 
and the like. A wide variety of labels have been employed in immunoassays 
which can readily be employed in the present assay. The choice of the label will 
be governed by the effect of tiie label on the rate of hybridization and binding of 
the probe to mutant nudeotide sequence. It will be necessary that the label 

20 provide sufRdent sensitivity to detect ttie amount of mutant nudeotide sequence 
available for hybridization. Ottier considerations will be ease of synthesis of the 
probe, readily available insbumentation. ability to automate, convenience, and 
the like. 

The manner in which the label is k}ound to the probe will vary depending upon the 
25 nature of the label. For a radioactive label, a wkie variety of technkiues can be 
employed. Commonly employed is nick ti^nsiation witti an a "P-dNTP or 
terminal labeling witti radioactive employing y'^P-ATP and T4 polynudeotide 
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kinase. Alternatively, nucleotides can be synthesized where one or more of the 
elements present are replaced with a radioactive isotope, e.g., hydrogen with 
tritium. If desired, complementary labelled strands can be used as probes to 
enhance the concentration of hybridized label. 

5 Where other radionuclide labels are involved, various linking groups can be 
emptoyed. A terminal hydroxyl can be esterified, with inorganic ackls, (e.g., 
phosphate), or ^^C organic adds, or else esterified to provkle linking groups to 
the label. Alternatively, intentiediate bases may be substituted with activatable 
linking groups which can then be linked to a label. 

1 0 Enzymes of interest as reporter groups will primarily be hydrolases, particularly 
esterases and glycoskjases, or oxidoreductases. particularly peroxkiases. 
Fluorescent compounds indude fluorescein and its derivatives, rtiodamine and 
its derivatives, dansyl, umbelliferone. and so forth. Chemiluminescers include 
luciferin, and 2, 3-dihydrophthalazinediones (e.g., luminol). 

1 5 The probe can be emptoyed for hybridizing to a nucleotide sequence affixed to 
a water insoluble porous support Depending upon the source of the nucleic 
acid, the manner in which the nudeic add is affixed to the support may vary. 
Those of ordmary skill in the art know, or can easily ascertsun. different supports 
which can be used in the method of the invention. 

20 Any manunalian ceils present in sputum are treated to liberate ttieir nudeic add. 
The target sequences containing mutant nudeotides are amplified by PCR or 
other aforementioned techniques. The amplified nudefe add from a sputum 
spedmen is spotted or spread onto a filter to provkle a plurality of individual 
portions. The filter is an inert porous solkl support (e.g., nitrocellulose). The 

25 lysing and denaturation of nudeic add, as well as ttie subsequent washings, can 
be achieved witti an appropriate solution for a sufficient time to iyse the cells and 
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denature the nucieic add. For lysing, chemical iysing will conveniently be 
employed, as described previously for the sputum lysis buffer. Other 
denaturatlon agents include elevated temperatures, organic reagents (e.g., 
alcohols, amides, amines, ureas, phenols and sulfoxides) or certain inorganic 
ions (e.g.. thiocyanate and perchlorate). 

In a prsfenred embodiment, the amplified nucieic acid containing the mutant 
nucleotide is be doned into a vector (e.g.. plasmid. cosmid, bacteriophage) 
utilizing the 5' restriction sites contained within the amplification primers. Each 
clone contains one copy of the amplified target sequence. The done is 
transferred to filters as described above, followed by denaturation. Hybridization 
with an oligonudeotide/spedfic for the mutant nucleotide allows the detection of 
one mutant nucleotide among 10,000 normal nucleotides which differ at a single 
base pair. 

After denaturation. the fitter n washed in an aqueous buffered solution, such as 
Tris. generally at a pH of about 6 to 8. usually 7. One or more washings may be 
involved, conveniently using the same procedure as empbyed for the iysing and 
denaturation. After the lysing. denaturing, and washes have been accomplished, 
the nudeic add spotted filter is dried at an elevated temperature, generally from 
about 50»C to 70'C. Under this procedure, the nucleic add is fixed in position 
and can be assayed with the probe when convenient. 

Pre-hybridization may be accomplished by incubating the filter at a mildly 
elevated temperature for a sufficient time with the hybridization solution without 
the probe to thoroughly wet the filter. Various hybridization solutions may be 
employed, comprising from about 20% to 60% volume, preferably 30%. of an 
inert polar organic solvent A comr^on hybridization solution employs about 50% 
fonnamlde, about 0.5 to 1M sodium chloride, about 0.05 to 0.1 M sodium citrate, 
about 0.05 to 0.2% sodium dodecylsulfate. and minor amounts of EDTA, ficoll 
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(about 300-500 kD). polyvinylpyrrolidone, (about 250-500 kD) and serum 
albumin. Also included in the hybridization solution will generally be from about 
0.5 to 5 mg/ml of sonnated denatured DNA (e.g., calf thymus or salmon sperm) 
and optmnally from about 0.5 to 2% wt/vol glycine. Other additives may also be 
included, such as dextran sulfate of from about 100 to 1,000 kD and In an 
amount of from about 8 to 15 weight percent of the hybridization solution. 

The particular hybridization technique is not essential to the invention. Other 
hybridization techniques are described by Gall and Pardue, Proc. Natl. Acad. Sci. 
£3:378. 1969; and John, ef a/., Nature, 222:582. 1969. As improvements are 
made in hybridization techniques they can readily be applied in the method of the 
invention. 

The amount of labelled probe which is present in the hybridization solution will 
vary widely, depending upon the nature of the label, the amount of the labelled 
probe which can reasonably bind to the fifter. and the stringency of the 
hybridization. Generally, substantial excess over stoichiometric concentrations 
of the probe will be empk>yed to enhance the rate of binding of the probe to the 
fixed target nudeic add. 

Various degrees of stringency of hybridization may be employed. The more 
severe the conditions, the greater the complementarily that is required for 
hybridization between the probe and the single stranded target nucleic add 
sequence for duplex fbrmatton. Severity can be controlled by temperature, probe 
concentration, probe length, ionic strength, time, and the like. Conveniently, the 
stringency of hybridization is varied by changing the polarity of the reactant 
sdution by manipulating the concentration of formamide in the range of 20% to 
50%. Temperatures employed will nonnally be In the range of about 20'C to 
80*C. usually 30'C to 75'C (see. generally. Current Protocols in Molecular 
Biology, Ausubel. ed.. Wfley & Sons. 1989). 
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After the filter has been contacted with a hybridization solution at a moderate 
temperature for a period of time sufficient to allow hybridization to occur, the filter 
is then introduced into a second solution having analogous concentrations of 
sodium chloride, sodium citrate and sodium dodecylsulfate as provided In the 
hybridization solution. The time for which the fitter is maintained in the second 
solution may vary from five minutes to three hours or more. The second solution 
determines the stringency, dissolving cross duplexes and short complementary 
sequences. After rinsing the filter at room temperature with dilute sodium citrate- 
sodium chloride solution, the filter may now be assayed for the presence of 
duplexes in accordance with the nature of the label. Where the label is 
radioactive, the filter is dried and exposed to X-ray film. 

The label may also be labeled with a fluorescence moiety which can then be 
probed with a specific antifluorescence antibody. Conjugated to this antibody is 
horseradish peroxidase enzyme, for example, able to catalyze a 
chemiluminescent reaction. Production of light can then be seen on rapid 
exposure to film. 

The materials for use in the assay of the Invention are ideally suited for the 
preparation of a kit Such a ktt may comprise a carrier means being 
compartmentalized to receive in dose confinement one or more container means 
such as vials, tubes, and the like, each of the container means comprising one 
of the separate elements to be used in ttte method. 

For example, one of the container means may comprise a hybridization probe 
which is or can be detectably labelled. A second container may comprise a 
sputum/epithelial cell lysis buffer. The kit may also have containers containing 
nucleotide(s) for amplification of the target nucleic acid sequence and/or a 
container comprising a reporter-means, such as a biotin-binding protein, such as 
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avidin or streptavidin, bound to a reporter molecule, such as an enzymatic, 
florescent, or radionuclide label. 

The above disclosure generally describes the present invention. A more 
complete understanding can be obtained by reference to the following specific 
5 examples vMch are provided heiein for purposes of illustration only and are not 
Intended to limit the scope of the invention. 

The Johns Hopkins Lung Project (JHLP) developed an archive of sputum 
specimens during a randomized trial of lung cancer screening (1974-1982). 

10 Fifteen patients from that trial went on to develop adenocarcinoma of the lung. 
The primary lung carcinomas from 10 of these 15 patients contained either a ras 
or a p53 gene mutation. Using a PCR-based assay, stored sputum samples 
obtained prior to clinical diagnosis were examined for the presence of these 
oncogene mutations. In eight out of ten patients, the identical mutation identified 

15 in the primary tumor was also detected in at least one sputum sample. The 
eariiest detection of a clonal population of cancer cells in sputum was in a 
sample obtained more than a year prior to clinical diagnosis. These results 
provide the basis for a novel approach for detection of lung cancer based on the 
evolving molecular genetics of this disease. 
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EXAMPLE 1 

PROTOCOL FOR SPUTUM DFTECTION 

A. DMA PREPARATION : Two to 3 ml of stored sputum (in nomial saline as 
fixative), were placed in 50 ml tubes. The tubes were spun at 1000 x g in a 
5 Beclcman model TJ-6 centrifuge for 15 minutes. The supernatant was decanted 
and 5 mi phosphate buffered saline (PBS) was added. The tubes were spun at 
1 000 X g in a Beckman model TJ«6 centrifuge for 15 minutes and decanted once 
again. One mi 5 ng/ul proteinase K« 1% SOS solution was added and the tubes 
were vortexed well. The tubes were then incubated in a water bath at 60*^ C for 

10 4-6 hours. The digested sputum was transferred to a 5 ml VACUTAINER 
(Becton-Dickenson). One ml PC-9 (phenol chlorofbm) equiiODrated with Tris at 
pH 9) was added and the tube was vortexed for 1 minute before spinning at 2500 
rpm for 15 minutes using a Hemiie Z360 K centrifuge. The supernatant was 
removed to a new VACUTAINER and the above last step was repeated one 

15 more time. 

The supematant was then transferred to a new plastic tube and 330 ul 10 M 
ammonia acetate, 2 ul glycogen, and 3.3 ml 200 proof ethanol were added. The 
tube was vortexed well and spun at 6000 rpm for 1 hour using a Hermie Z360 K 
centrifuge. The si4>ematant was decanted and 3.3 ml 70% Ethanol vras added. 
20 The tube was spun in the above condition for 2 minutes and the tube was 
decanted again. The resulting pellet was dried in a HETOVAC (vacuum drier). 
The pellet was then resuspended in 100 ul of distilled water. The sample was 
kept at 4^ C. 

An attemative method for enrichnrient of malignant epithelial cells and elimination 
25 of non-epitheiial cells in the sputum, prior to PGR, is as follows. Sputum cells are 
were isolated by centrifugation at 1 .000 x g. Cells were washed twice in HBSS 
with 2% fetal calf serum (PCS) and resuspended in 1 ml. The cell suspension 
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was cooled on ice and 20 ug/ml of a monoclonal antibody with specificity for 
epithelial cells, EBA-1 (other antibodies with specificity for epithelial cells would 
be equally effective) was added to the cell suspension and incubated on Ice for 
one hour. The cells were washed twice In cold (2-8*C) HBSS with 2% FCS. The 
5 ceils were resuspended in cold HBSS/2% FCS at a ceil concentration of 2-4 x 1 0" 
per ml. 

Primary EBA antibodies (bound to epithelial cells) were isolated on miaignetic, 
Dynabeads (Oynai International, Oslo Norway) as follows. Dynabeads M-450 
were coated with a secondary antibody (sheep-antimouse. by the manufacturer). 
1 0 The beads were first washed twice for 5 minutes at 2-8*C in phosphate buffiered 
saline (PBS), pH 7.4, containing 0.1% FCS. The Dynabeads were collected 
using a magnet and the supernatant was discarded and the beads resuspended 
in equivalent initial volume. 

The Dynabeads coated with the secondary anti-murine antibody at a ratio of 5 
particles per target cell were added to the sputum sample. The concentration 
should be about 10' beads per ml of solution. The mixture was incubated for 30 
minutes at 4*C on a Rock-N-Roller. Cold (4-8'C) HBSS with 1% FCS was added 
in a volume at least 4x the volume of the beadlceW suspension. The Dynabeads 
were concentrated using the magnet The supematant was removed and the 
beads washed thoroughly 3 times using HBSS/1% FCS in a volume equal to that 
above (at least 4x the volume of the bead/celi suspension). The Dynabead/cell 
suspension was centrifuged at 1 ,000 x g and resuspended in SDS/Proteinase K 
and DNA isolated for PCR. 



15 



20 
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B. PGR AMPl IFICAHQN 

1 . For k-ras, amplify 120 bp DNA fragment: 



Sputum DNA 5 ul 

Distilled water 34 ul 

5 PCR bufler(10 X) 5 ul 

dNTP 3ul 

Primer Kras 1 1 ul 

Primer Kras 1 as 1 ul 

Taq(polymerase. 5 U/ul) 1 ul 



10 Primer Kras 1= 5'-^GGAATTCATGACTGAATATAAACTTGT^' 

(SEQ. ID NO. 1): 

Kras 1 AS = 5'-ATCGAATTCTATGCATATTAAAACAAGATT-3' 
(SEQ. ID NO. 2) 



15 The sample was placed in a 500 ul tube to wtiich 2 drops of mineral oil were 
added. The sample was amplified in an Omnigene PCR machine, as follows: 95° 
C for 30 sec.; 55** C for 1 min. 35 cydes; 70° C for 1 min.; and 70° C for 5 min. 
1 cyde. 
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2. For p53 (exons 4-5). amplify 500 bp DMA fragment: 



Sputum DNA 5 ui 

Distilled water 34 ul 

PCRbufferClOX) 5ul 

5 dNTP 3 ui 

Primer 5S 1 ul 

Primer lnt-6AS 1 ul 

Taq(polymerase. 5 U/ul) 1 ul 



Primer 5S - 5'-GTAGGAATTCACTTGTGCCCTGACTT-3' 
10 (SEQ. ID NO. 3); 

Primer lnt-6AS = 5*-CATCGAATTCCACTGACAACCACCCTT-3' 
(SEQ. ID NO. 4). 

The sample was placed in a 500 ui tube to which 2 drops of mineral oil were 
added. The sample was amplified in an Omnigene PGR machine, as follows: 95° 
15 C for 30 sea; 60° C for 1 min., 35 cycles; 70° C for 1 min.; and 70° C for 5 min.. 
1 cyde. 
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3. For p53 (exons 6-7), amplify 750 bp DNA fragment: 



Sputum DNA 5 ul 

Distilled water 34 ul 

PCRbuffer(10X) 5ul 

5 dNTP 3 ul 

Primer 7S 1 ul 

Primer 8AS 1 ul 

Taq(polymerase, 5 U/ul) 1 ul 



Primer 7S AS « 5'-GTAGGAATTCCAAGGCGCACTGGCCTC-3' 
10 (SEQ. ID NO. 5): 

Primer 8 AS = 5'-ACTGAATTCTTCGTCTCCTCCACCGC-3' 
(SEQ. ID NO. 6). 

The sample was placed In a 500 ul tube to which 2 drops of mineral oil were 
added. The sample was amplified in an Omnigene PGR machine, as follows: 95* 
15 C for 30 sec.; 60" C fori min., 35 cycles; 70* C for 1 min.; and 70" C for 5 min., 
1 cycle. 

All amplifications performed with negative control (water devoid of any DNA) and 
positive (cell line DNA) controls, such as SW480 (12 yal mutation K-ras) or DZ74 
(273 cys mutation p53). 

20 C.CL^MNG 

To 5 ul of the PGR product above, 5ul of 2X stop buffer (Bromophenoi blue in 
Ficon, glycerol and sarcosyi with tris-acetate buffer) was added. The samples 
were run on 1% or 2% agrose gels to observe the yield of amplification. The 
remaining 45 ul of the PGR product, was mixed with 1 55 ul distilled water. To the 
25 200 ul PGR mixture, 200 ul PC-9. was added and vortexed well. The tube was 
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spun for 2 min. in Hermie 2230 M table centrifuge at high speed. The 
supernatant was removed to another tube and the pellet was treated with PC-9 
one more time. 

The supernatant was removed to a new 1.5 ml tube and 66 ul 10 M ammonia 
5 acetate, 2 ul glycogen and 660 ul 200 proof ethanol were added and the tube 
was vortexed well. The tube was spun for 20 min. in a Hennle Z230 M table 
centrifuge at high speed. The supernatant was decanted and 660 ul 70% 
ethanol, was added and the solution mixed. The tube was spun for 2 more 
minutes and decanted again. The sample was dried in a HETOVAC. 

10 DNA was resuspended by adding 4 to 8 ul of distilled water depending on the 
size of the pellet Two ul of DNA was mixed with 1 ui T4 Lamda Zap I, 
(Stratagene, La JoUa, CA) 1 ul standard T4 ligation buffer (5 X). The DNA 
mixture was incubated at 65" C for 5 min., 37" C for 5 min. and 24*' C for 5 min. 
in a water bath. One ul of T4 Ngase was added and the mbcture was incubated 

15 at 15" C for 4 to 6 hours. One ul of the ligated product was mixed with 2.4 ui 
packaging extract (Stratagene. La Jolla, CA) (Red) and 3.75 ul paci(aging extract 
(Yellow). The ligation mixture was icept at room temperature for 2 hours. 250 ul 
phage dilution buffer (stock piiage) was added to the mixture after 2 hours. 

About 10 ul to 100 ul stock phage were added to 100 ul XL1-B cells and 
20 incubated at 37" C for 10 min. 4 ml of 55"C top agarose was added and the 
mixture was plated on L-Agar gel plates at 37" C overnight 
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D. HYBRIDIZATION 

A piece of nylon hybridization transfer membrane Zetaprobe (BioRad Richmond, 
CA) was laid on the surfece of the gel which contains lysis plaques for 1 min. 
The membrane was then transferred to on a blot paper soaking 0.5 M NaOH, 
plaque side up for 15 min. The membrane was then rinsed in 2X SSC for 5 min. 
twice. The membrane was then placed on a blot paper before crosslinking under 
UV light for 30 seconds. The membranes were then placed in plastic bags for 
hybridization. 

Oligimers (Table 1) were radioactively labeled using y-ATP by standard 
methods (T4 kinase). The labelled probes were added to bags containing plaque 
lifts. Hybridization was performed at the temperature which is 10^ C below the 
melting temperature of the probe for 1 hour in a shaking bath. The membranes 
were then removed and washed in 3X SSC/0.1% SDS at room temperature for 
5 min. and in 3X SSC/0.1% SDS at the melting temperature of the probe for 30 
min. The excess solution was removed from the membrane before wrapping in 
Saran Wrap. The membranes were exposed at *80^ C for 4 hours or ovemight. 
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Qliffomere Uaad In Sputum Dflteetion 

Oligomers used In detection of p53 gene mutations 

SEQ. ID NO.; 

1 5. 5'-CAC CAG CCC CTC CTG G-3' (codon 89 del 1 C) Melting T. 56»C 

1 6. 5'-TCT GGG CTG CTT 6CA TTC-S* (codon 113 TTC to TGC) MT. 56*C 

17. 5'-GCC AAC TGC CCA AGA CC-3* (codon 138 GCC to CCC) MT. 66'C 

1 8. 5'-GCC CTG TGT AGC TGT GG-3' (codon 144 CAG to TAG) MT. 56»C 

1 9. 5'-GCA GCT GTG AGT TGA TTC-3' (codon 146 TGG to TGA) MT. 54«C 

20. 5'-TTC CAC ACA CCC GCC CG-3' (codon 151 CCC to CAC) MT. 58»C 

21 . 5'-CCC GCC CGT CAC CCG C-3' (codon 154 GGC to GTC) MT. 60«C 

22. 5'-GCA CCC GCT TCC GCG C-3' (codon 157 GTC to TTC) MT. 58'C 

23. 5'-CCG CGT CCT CGC CAT G-3' (codon 1 58 CGC to CTC) MT. 56'C 

24. 5'-CGT CCG CGT CAT GGC C-3' (codon 1 59 GCC to GTC) MT. 56»C 

25. 5'-CGC CAT G(3A CAT CTA CA-3' (codon 161 GCC to GAC) MT. 52«C 

26. 5'-CGC CAT GAC CAT CTA CA-3' (codon 1 61 GCC to ACC) MT. 52*C 
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SEQ. ID NO - 


27 


o MOM 1 vjm uij 1 Alau ] TG TQA-3 (codon 171 GAG to TAG) MT. 52*C 


28 


0 Mta^s i M 1 QaA GGC GC-3 (oodon 173 GTG to TTG) MT. 66'C 


29 


0 - 1 iij 1 u»A tot>g ACT ecc CC-3' (codon 175 CGC to CAC) MT. 56 'C 


30. 


o 1 1 o 1 i^M vji3U 1 u f ecc CC-3 (codon 175 CGC to CTC) MT. 56'C 


31 


o AVaia utoc JGG CCC CAC C-3' (codOR 176 TGC to TGG) MT. 58»C 


32 


5 -AGG CGC TAC CCC CAC C-3' (codon 176 TGC to TAC) MT. 56«C 


33 


5-AGG CGC TTC CCC CAC C-3' (codon 176 TGC to TTC) MT. 56«C 


34 

W 1 • 


D CCA CTA TGA GCG CT-3' (codon 179 CAT to TAT) MT. 56*C 


35 

WW* 


0 uoc CAC CAG GAG CGC T-3' (codon 179 CAT to CAG) MT. 56-C 


36. 


5'-CCC CAC CGT GAG CGC T-3' (codon 179 CAT to CGT) MT. 56-C 


37. 


5'.CCC CCA CGA TGA GCG C-3' (codon 179 CAT to GAT) MT. 5e-C 


38. 


5'.TCA GCA TCG TAT CCG AG-3' (codon 194 CTT to CGT) MT. 52«C 


39. 


5'-ATC CGA GTG TAA GGA AATT-3'(codon198 GAA to TAA) MT. 52-C 
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SEQ. IP NQ.; 

40. 5'-CGA GTG GAA GAA AAT TTG C-3'(codon199 GGA to GAA) MT.54«C 

41 . 5'-TGT GGA GTG T7T GGA TGA-3' (codon205 TAT to TGT) MT. 52'C 

42. 5'-TGG ATG ACT GAA ACA CTT T-3'(codon209 AGA to TGA) MT. 62'C 

43. 5'-ACA CTT TTT GAC ATA GTG T-3'(codon21 3 CGA to TGA) MT. 50'C 

44. 5*-ACA CTT TTC CAC ATA GTG T-3'(codon213 CGA to CCA) MT. 52'C 

45. 5'-CAC TTT TCG ACG TAG TGT G-3'(codon214 CAT to CGT) MT. 56*C 

46. 5'-TTT TCG ACA TAT TGT GGT G-3'(codon215 AGT to ATT) MT. 52'C 

47. 5'-ATA GTG TGT TGG TGC CCT.3' (oodon 217 GTG to TTG) MT. 54'C 

48. 5'-GGT GCC CTG TGA GCC G-3' (codon 220 TAT to TGT) MT. 56*C 

49. 5'-CCG CCT GAC GTC TGG TT-3' (codon 224 GAG to GAC) MT. 56'C 

50. 5'-TCT GAC TGA ACC ACC ATC-3'(codon 229 TGT to TGA) MT. 54'C 

51. 5'.CAT CCA CTG CAA CTA CAT-3'(w;;don 234 TAC to TGC) MT. 52'C 

52. 5'-CAA CTA CAT ATG TAA CAG TT-3'(codon237 ATG to ATA) MT.52'C 
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SEQ. IP NO.; 

53. 5 -ACT ACA TGT TTA ACA GTT CC-3'(codon238 TAT to TTT) MT.54»C 

54. 5'^CT ACA TGT GAA ACA GTT CC-3'(codon238 TAT to TGA) MT.56*C 

55. 5'-CAG TTC CTC CAT GGG CG-3' (codon 242 TGC to TCC) MT. 56»C 

56. 5'-CAG TTC CTT CAT GGG CG-3' (codon 242 TGC to TTC)-MT. 54«C 

57. 5*-CAG TTC CTG GAT GGG CG-3' (codon 242 TGC to TGG) MT. S6'C 

58. 5'-CCT GCA TGT GCG GCA TG.3' (codon 244 GGC to TGC) MT. 5e»C 

59. 5'-GCA TGG GCT GCA TGA AC-3' (codon 245 GGC to TGC) MT. 54«C 

60. 5'-GCA TGG GCG ACA TGA AC-3' (codon 245 GGC to GAC) MT. 54*C 

61 . 5'-GGC GGC ATC AAC CGG AG-3' (codon 246 ATG to ATC) MT. 58*C 

62. 5'.GGC ATG ATC CGG AGG CC-3' (codon 247 AAC to ATC) MT. 58«C 

63. 5'-CAT GAA CCT GAG GCC CAT-3' (codon 248 CGG to CTG) MT. 56«C 

64. 5'-GCA TGA ACT GGA GGC CCA-3' (codon 248 CGG to TGG) MT. S8*C 

65. 5'-GCA TGA ACC AGA GGC CCA-3' (codon 248 CGG to CAG) MT. 58«C 
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SEQ. ID NO.: 

66. 5'-AAC CGG AGT CCC ATC CTC-3' (codon 249 AGG to AGT) MT. 58»C 

67. 5'^C CGG AGC CCC ATC CT-3* (codon 249 AGG to AGC) MT. 56*C 

68. S'-AAC CGG ATG CCC ATC CTC-3' (codon 249 AGG to ATG) MT. 58*C 
5 69. S'-GAA CCG GGG GCC CAT C-3' (codon 249 AGG to GGG) MT. 56*C 

70. 5'-TCA CAC TGT AAG ACT CCA-3'(codon 258 GAA to TAA) MT. 52«C 

71 . 5'-TCA CAC TGA AAG ACT CCA-3'(codon 258 GAA to AAA) MT. 52«C 

72. 5'-CAC TGG AAG TCT CCA GGT-3'(codon 259 GAC to GTC) MT. 56*C 

73. S-.TAA TCT ACC GGG ACG GAA-3'(codon 265 CTG to CCG) MT. 54*C 
1 0 74. 6'-TAA TCT ACC TGG ACG GAA.3'(codon 285 CTG to CCT) MT. 52»C 

75. 5'-TCT ACT GGT ACG GAA CAG-3'(codon 266 GGA to GTA) MT. 54-C 

76. 5'-ACT GGG ACC GAA CAG CTT-3'(codon 267 CGG to CCG) MT. 54«C 

77. 5'-GCT TTG AGC TGC GTG TTT-3'(codon 272 GTG to CTG) MT. 54«C 
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SgQ. ID NO.: 

78. 5'-GCT TTG AGG AGC GTG TTT-3'(codon 272 GTG to GAG) MT. 54*C 

79. 5'-TTG AGG TGC TTG TTT GTG-3'(codon 273 CGT to CTT) MT. 52«C 

80. 5'-TTG AGG TGT GTG TTT GTG-3'{codon 273 CGT to TGT) MT. 52"C 
5 81. 5'-TTGAGGTGCATGTTTGTG-3'(codon273CGTtoCAT)MT. 52*C 

82. S'-TTG AGG TGC CTG TTT GTG-3'(codon 273 CGT to CCT) MT. 54»C 

83. 5'-AGG TGC GTG GTT GTG CCT-3'(codon 274 6TT to GGT) MT.. 58-C 

84. 5'-GCG TGT TTA TGC CTG CCT-3*(codon 275 TGT to TAT) MT. 56*C 

85. 5'-GCG TGT TTT TGC CTG CCT-3'(codon 275 TGT to TTT) MT. 56*C 
1 0 86. S'-TTG TGC CTT TCC TGG GAG-3*(cock)n 277 TGT to TTT) MT. Se'C 

87. 5 -TGC CTG TCT TGG GAG AGA-3'(codon 278 CCT to CTT) MT. 58-C 

88. 5'-TGC CTG TTC TGG GAG AGA-3'(codon 278 CCT to TCT) MT. 56*C 

89. S'-TGC CTG TCG TGG GAG AGA-3'(codon 278 CCT to CGT) MT. 58*C 

90. 5'-TCC TGG GAT AGA CCG GCG-3'(codon 280 AGA to ATA) MT. 58'C 
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SEQ. IP NQ.; 

91 . 5'-TGG GAG ATA CCG GCG CAC-3'(codon 281 GAC to TAC) MT. 58«C 

92. 5'-GAG AGA CCC GCG CAC AG-3' (codon 282 CGG to CCG) MT. 58'C 

93. 5 -GAG AGA CGG GCG CAC AG-3' (codon 282 CGG to GGG) MT. 58*C 

94. 5'-GAG AGA CTG GCG CAC AG-3' (codon 282 CGG to TGG) MT. 56*C 

95. S'-AGA CCG GCC CAC AGA GG-3' (codon 283 CGC to CCC) MT. 58*C 

96. S'-AGA CCG GGG CAC AGA GG-3' (codon 283 CGC to GGC) MT.-58*C 

97. 5'-CCG GCG CCC AGA GGA A-3' (codon 284 ACA to CCA) MT. 56»C 

98. 5'-GCG CAC AAA GGA AGA GAA-3'(codon 285 GAG to AAG) MT. 54«C 

99. 5'-CAC AGA GGG AGA GAA TCT-3'(codon 286 GAA to GGA) MT. 54*C 

100. S'-ATC TCC GCTAGAAAG GGG-3'(codon 291 AAG to TAG) MT. SB'C 

1 01 . 5'-GCA AGA AAG GGA GCC TC-3'(codon 293 def 1 G) MT. 54*C 

1 02. 5'-CTC ACC ACT AGC TGC CC-3'(codon 298 GAG to TAG) MT. 5e»C 

1 03. 5'-GAT GTT CTG AGA GCT GAA-3'(codon 342 CGA to TGA) MT. 52'C 
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SgQ.IDNQ.; 

1 04. 5'-GAT GTT CCA GAG CTG AAT-3'(codon 342 del 1 G) MT. 52«C 

105. 5'-GGC CTT GAA CTC AAG GAT-3'(codon 349 del 1 G) MT. 54'C 

106. 5'-CTG GGA GAC ACC GGC G-3' (codon 281 GAG to CAC) MT. 56*C 

107. 5'^CC GGA GGT TCA TCC TC-3*(codon 250 CCC to TTC)-MT. 54*C 

108. 5'-CAT GTG TAA ACA GTT CCT G-3'(codon 239 ins 1 A) MT. 54«C 

109. 5'-AQC ATC TTA ATC CGA GTG-3'(codon 1 95 Ins 1 A) MT. 52*C 

110. 5*-GTC TGG CCC TCC TCA GC-3' (codon 191 del 1 C) MT. 58'C 

111. 5'-GCT GCC CCC CAC CAT GA-3' (codon 178 ins 1 C) MT. 58»C 

1 12. 5'-CCG CGT CGC GCC ATG-3' (codon 158 del 1 C) MT. 54»C 

113. 5'-AGA CCT GCC TGT GCA GC-3' (codon 142 del 1 C) MT. 56*C 
m. ff-CCT GTC CTT GQQ AQA QA-3' fcodon 279 i ns 1 T^ MT S4»n 

(Boyle, et aL, Can. Res. 5aiiai:4477, 1993; Hollstein. et al., Scionce 253:49. 
1991; Soniers. etal., Can. Ras, £2:5997. 1992; Sakal, atal., Oncogena, 1:972, 
1992). 
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Oligomers used in detection of ns gene mutations 
SEQID NO.: 

1 1 5. 5'-GGAGCTGTTGGCGTAGGCAA-3' (Val«). 

1 16. 5'-GGAGCTGATGGCGTAGGCAA.3' (Asp^^. 

1 17. 5'-GGAGCTAGTGGCGTAGGCAA.3' (Ser«). 

1 1 8. 5'-GGAGCTTGTGGCGTAGGCAA-3' (Cys'*) 



E. HYBRIDIZATION AND DETECTION 

Attemativeiy. the oligomers were labeled using chemiiuminescenoe. 100 pmoies 
of oligomers was added to 16 ui cacodytate buffer, 10 ul fluorescein-dUDP, 16 
ul terminal transferase and water to total 160 ul. The mixture was incubated for 
1 hour at 37" C. 

Membranes were pre-hybrklized for 1 hour in [5XSSC/.02% SOS and 0.5% (WA/) 
blocking agent (milk) (Amersham, UK)]. The probe was added and hybridization 
was altowed to go for 1 hour at the temperature which is 10^ C below the melting 
temperature of the probe in shaking bath. The membranes were washed in 3X 
SSC/0.1% SDS at room temperature 5 min. twice. The membranes were then 
washed in 3X SSC/0.1% SDS at the melting temperature of the probe for 15 min. 
three times m the washing bath. The membranes were then rinsed (NaCI, 0.15 
moi/lit Tris buffer. 0.1 mol/lit plH7.5) for 1 minute followed by incubation in block 
buffer (5% dry nonfat milkHBS) for 30 min. The membranes were then rinsed 
in TBS for 1 min. and incubated in antibody solution (1:5000 anti-fluorescein 



.38- 

alkaiine phosphatase antibody (Boehringer Manheim) in 5% mill(/TBS) for 30 
minutes. The membrane was then washed in TBS for 5 min. eight times with 
shaking. The detection reagent, lumigen PPD (Boerhinger Manheim 1:100 in 
ll^gCl2,50 mmol/iit/TBS, pH 9.5) were mixed, and the blots incubated in the 
5 solution for 1 min. The extra soiutran was removed and the membrane wrapped 
with plastic and exposed to X-ray film immediately forlO to 60 minutes. 



EXAMPLga 

Paraffin embedded primary lung tumor samples from 15 patients who went on 
to develop adenocarcinoma of the lung during the JHLP trial were studied (Table 

10 1 }. Patients were chosen based on the availability of paraffin embedded tissue 
following surgical resection and negative sputum cytology prior to definitive 
diagnosis. Following extractmn of tumor DNA, a segment of two target genes 
from these primary carcinoma samples was amplified through PGR in order to 
detect mutations in the k-ras oncogene and p53 tumor suppressor gene. 

1 5 Adenocarcinomas were chosen because these tumors have a higher incidence 
of k-ras mutatkjns (30%) than other lung tumors (Rodenhuis, et al., Can Res 
43:5738-5741, 1988; Rodenhuis, S & Slebos, RJ. Am Rev Res Pis 142!S27.S3Q 
1990). p53 gene mutations were examined as these are the most common 
genetic alterations found in these tumors and a variety of other cancers 

20 (Takahashi, et al., Science 24fi:491-494, 1989; Kishimoto, et al., Can Res 
52:4799-4804, 1992; Hollstein. ef al., Science 253:49-53. 1991). Sequence 
analysis of the PGR products from the two target genes identified 10 primary 
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tumors which contained either a k-ras or p53 gene mutation (Table 2). The k-ras 
mutations were those commonly associated with adenocarcinoma of the lung 
and occurred predominantly at cod on 12. The two p53 mutations occurred at 
codons 273 and 281, and both were previously described in iung cancers 
5 (T akahashi. et al., supra; Kishimoto. et al., supra; j-ioHstein. et al., supra). Many 
tumor samples were small (<1cm) and DMA was insufficient for extensive 
sequence analysis of p53 (exons 5 to 8) in all tumors. 

Following identification of tumors with oncogene mutations, all of the available 
corresponding sputum samples from these affected patients were obtained. 
1 0 None of the patients that were analyzed ever had a positive sputum cytology in 
the JHLP study. Therefore, all but one of the patients (L4 evaluated because of 
cough) were diagnosed by chest x-ray. 

Those sputum samples that were available, (generally one or two samples prior 
to dinkal diagnosis) were analyzed by a PCR-based assay able to detect one 

15 mutant containing cell among an excess background of 10.000 nomnal cells 
(Skiransky. D.. etal, Sdenoe. 252:706-709, 1991; Sidransky. D., etal.. Sciance 
2^:102-105. 1992). This assay was based on the amplifteation of sputum DMA, 
followed by doning into a phage vector and transfer to nylon membranes as 
described in Example 1 A mutant-specific oligomer was hybridized to each of 

20 the filters to Mentify specific point mutations in either the k-ras or p53 gene 
present in sputum. Neoplastic cells hartxmng either ras or p53 mutations were 
detected in sputum from eight of the 10 patients who had tumors containing 
oncogene mutations. Positive sputum samples harboring a donal population of 
cancer cells were obtained from one to 13 months prior to dinical diagnosis 



-40- 



(Table 2). Detection of one cancer cell among 160 normal cells (1/320 2 = 160; 
each nomnal cell contributes 2 wlldtype alleles) was detected In sputum from 
Patient L4 thirteen months prior to his clinical diagnosis (Table 2). The tumor 
from this patient contained )95% positive plaques when probed with a mutant- 
5 specific codon 273 oligomer for p53 and his sputum sample also contained 
several positive plaques (Rgure 1). Many clones with a PGR insert from the p53 
gene hybridized to an oligomer probe specific for the codon 273 Arg-His 
mutation in the patient's (L4) tumor (T). A fewer nun^r of dones hybridized to 
the same specific probe in the patients sputum(S). There were no hybridizing 

1 0 clones to this probe in a control sputum from a patient (Li 5) without a p53 gene 
mutation in his primary lung cancer (C). Stored sputum samples in 30cc screw 
top glass bottles containing 2% carbowax/50% alcohol (Saccomanno's fixative) 
pnesen/ative solution were located [details on collection of samples in (Beriin, Nl, 
Buncher, CR, Fontana, RS, Frost, JK & Meiamed. MR, Am Rev Resp Dis 

1 5 1 30:545-549, 1 984; Tockman. MS, Chest 89:324S-325S. 1 986; Tockman, MS, 
etal., J Clin Onco/£:1 685-1 693, 1988)]^ 5cc was removed from each (approx 
50,000-500,000 cells), spun at x1,000g for 5-mins. and rehydrated in 5cc of 
normal saline. Cells were ttien respun xl.OOOg and resuspended in 1 ml of 1% 
SDS/proteinase k (5mg//il was used for each PCR reaction perfonned in a 

20 separate PCR dedicated room to efiminate possibility of contamination. Primers 
from k-ras and pS3 contained EcoRI sites to facilitate cloning. Following 35 
cycles of amplification, products were cleaved with EcoRI and iigated to lamda 
Zap II (Stratagene) (Sidransky, D., etal., Science 2SS:102-105, 1992). XU cells 
infected with bacteriophage were plated on L-Agar at a density of 500-3,000 

25 plaques per plate, transferred to nylon membranes and hybridized with 
oligonucleotides used for hybridizations were labelled with ^p and hybridized as 
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in (Sidransky, 0.. et al., Science 2S2:706-709, 1991; Sidransky, et al, 1992, 
Supra). Ol^onucleotides used for detection included: 5'- 
GGAGCTGGTGGCGTAGGCAA-3' for WT ras (SEQ. ID NO. 119). 5'- 
GGAGCTGTTGGCGTAGGCAA-3' for the Val" mutant (SEQ. ID NO. 115). 5'- 
GGAGCTGATGGCGTAGGCAA-3' for the Asp" mutant (SEQ. ID NO. 116). 5'- 
GGAGCTAGTGGCGTAGGCAA-3' fbrthe Ser" mutant (SEQ. ID NO. 117). 5'- 
GGAGCTTGTGGCGTAGGCAA-3' fbrthe Cys" mutant (SEQ. ID NO. 118). 5- 
ATGGGCGCCATGAACCGG-3' for WT p53 (SEQ. ID NO: 120), 5'- 
TTGAGGTGCATGTTTGTG.3' for the His*" mutant (SEQ. ID NO. 121), and 5'- 
TGGGAGAGGCCGGCGCA-3'fbrthe Gly"' mutant (SEQ. ID NO. 122). 

To verify Identification of the codon 273 mutation in the sputum, a positive plaque 
from the sputum assay was picked and sequenced confinning presence of the 
same mutatran (FIGURE 2). Sequencing gels of pooled denes for p53 show the 
WT sequence In a nonnal (N) control and the 273 C6T-CAT (Arg-His) mutant 
sequence (arrow) in the tumor (T) with a residual WT band. One positive plaque 
depicted in FIGURE 1(S) was picked and sequenced revealing the same mutant 
band in sputum (S) as that found in the primary tumor. In each panel, the lanes 
represent A, C, G. and T temiinations from left to right Briefly, tumor DNA was 
extracted from paraffin slides by Incubation in xylene, foltowed by treatment with 
SDS/^rateinase k and ethand predpitatnn. Exon 1 of the K-ras gene and exons 
5 to 8 of p53 (if suffKient DNA available were amplified by PGR. cloned Into 
Lamda Zap II (Stratagene) and sequenced as described (Sklransky. et al., 1991 , 
supra). However, the pS3 gene was amplified in two segments utilizing the 
following primers: p53-5S 5'-GTAGGAATTCACTrGTGCCCTGACTT-3' (SEQ. 
ID NO. 3) and p53-6AS 5'-CATCGAA7TCCACTGACAACCACCCTT-3' (SEQ. ID 
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NO. 4) (exons 5-6): p53-7S 5'-GTAGGAATTCCAAGGCGCACTGGCCTC-3' 
(SEQ. ID NO. 5) and p53-8AS S'-ACTGAATTCTTCGTCTCCTCCACCGC-S' 
(SEQ. ID NO. 6) for exons 7-8. 

In positive sputum samples, the ratio of cancer to nomnal cells ranged from one 
5 in 600 (Patient L5) to 1 in 50 (Patient L3), with the majority of sample containing 
approximately one cancer ceii among 150 nomnal cells. Five control patients with 
lung cancer whose tumors did not contain either a k-ras (codons 12^nd 13) or 
p53 (exons 5 to 8) gene mutation t>y sequence analysis were negative by this 
assay for all twelve k-ras and both p53 mutant-specific probes. 

10 In two patients, both of whom had only a single sputum sample available a clonal 
population of mutant-containing cells was undetectable despite the presence of 
mutations in their primary tumors (Table 2). Furthennors. the first emergence of 
a clonal populatfon of cancer cells in positive patients was untraceable by this 
sputum assay because of missing samples prior to diagnosis. However, in six 

15 of the 1 0 patients, sputum samples were available >24 months prior to diagnosis 
and were negative. This suggests a limit of 13 to 24 months for molecular 
diagnosis by our assay prior to dinicai presentation. As a further control, 6/8 
patients who initially tested positive. weiB found to test negative in sputum 
samples obtained following complete surgical resection of theirtumor. There did 

20 not appear to be any correlatfon between the site of the tumor and the ability to 
detect mutations in sputum samples by this assay. Talse negative" samples 
may be due to inappropriate collection, poor cellular content or to the variable 
presence of tumor cells in sputum. Infomfiation gained from cytdogical diagnosis 
in sputum through routine light microscopy has previously suggested that 
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examination of two or three samples may improve the diagnostic yield 
significantly, and this likely pertains to molecular diagnosis as well. 

Patient L4 with a large T, lesion had the longest interval (13 months) from 
sputum detection to diagnosis. He also had a negative transthoracic biopsy six 
months prior to definite diagnosis because a small lesion was noted on chest x- 
ray. This patient thus had two opportunities for routine cytopathologic detection 
prior to clinical diagnosis. He died of metastatic disease four months following 
surgical resection. This particular case serves to illustrate the limitations of 
routine cytology and the ability to augment morphologic analysis through the use 
of probes for specific gene mutations. 
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UBLtl 

I \ }fM CAMCgR PATIgMTS WITH GENE MUTATION 
ANALYSIS OF SPUTUM 



10 



15 



20 



25 



30 



MUTATIONS FOUND 
IN SPUTUM 



atient 

lo. 



Age/ 
Sex 



L1 

L2 

L3 

L4 

L5 

L6 

L7 

L8 

LQ 

L10 

L11 

L12 

L13 

L14 

LIS 



Tumor 

Location 

^o. 



65/M 
57/M 
63/M 
51/M 
67/M 
67/M 
7Q/M 
59/M 
48/M 
63/M 
60/M 
56/M 
6S/M 
61/F 
62/M 



Tumor Type/ 
State* 



RUL 
LUL 
RUL 
LUL 
LUL 
RUL 
RUL 
RUL 
RUL 
RLL 
RUL 



RUL 
RUL 
LUL 



A/n,N0,MO 
AyT2.N1.M0 
A./ri.N0.MO 
Ayr3.N0,MO 

Ayri,N0.MO 

AyT3.NG.M0 
A/T1.N0,MO 
A/TI.NO.MO 

Ayri,N0,MO 

A/ri.N0.MO 
A/r3.N1.M0 
AyT1,N0,MO 
AyT1.N0.MO 
AyT2.N0.MO 
AyT2,N0,MO 



Tumor 
Mutations 



ras-12Ser 
12/^p 
12Val 
)53-273Hi8 
<ra8-12Cys 
353-281Gty 
Kras-12Cys 
Kra8-12Cys 
Kras-12Val 
KFas-12Cys 
None 
None 
None 
None 
None 



IPre- 


r\aiiw 


Post- 1 


Dx 


(WD 


Od I 


(Mon.) 






3 


1/270 


Neg. 1 


1 


i/3oa 


N/A 


4 


1/100 


Neg. 


13 


1/320 


N/A 


1 


1/1200 


Neg. 


Neg. 


3 


Neg. 


1 


1/220 


Neg. 


1 


1/170 


Neg. 


Neg. 


0 


Neg. 


1 


1/330 


Neg. 


Neg. 


0 


ND 


Neg. 


D 


ND 


Neg. 


0 


ND 


Neg. 


0 


ND 


Neg. 


0 


ND 



35 



A.: adenocarcinoma 

L: Large cell carcinoma 

Pra-Dx: months prior to actual clinical diagnosis 

Op: operation N/A: Not available 

ND: not done 

MTT: number of mutant alleles/Total plaques witti target gene insert. 

#RUL: right upper lobe; 

LUL: left upper lobe; 

RLL: right lower lobe; 

LLL: left lower lobe, of the lung . 

'Staging system is according to the ASCC (Mountain, CF., Chest, fiS:225S-233S. 
1986). All sputum samples for Patients L1 to L10 were reamplified, recioned, 
and the assay repeated for verification. 
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The results of these experiments provide an enit>odiment wherein successful 
detection of neopiaisia was accomplished and provides a practical basis for a 
new approach for detecting the presence of neoplasias, such as lung tumors, in 
a non-invasive fashion. The approach would have utility in monitoring patient 
5 populations and treatments designed to minimize the incidence of neoplasia. It 
also could be used in screening asymptomatic patients, especially those at risk 
by virtue of inherited or environmental factors, for the presence of neoplasia. 
The current results indicate that a significant fraction of eariy lung cancers and 
dangerous pre-malignant lesions can be identified through this strategy. 
10 Additionally, these findings indicate that other mutant nucleotide sequences, 
besides K-ras and p53, which are associated or indicative of lung neoplasias, 
would also be detectable in sputum specimens. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(I) APPLICANT: The Johns Hopkins University School of Medicine 

(ii) TITLE OF INVENTION: NUCLEIC ACID MUTATION DETECTION BY 
5 ANALYSIS OF SPUTUM 

(iii) NUMBER OF SEQUENCES: 128 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Spensley Horn Jubas & Lubrtz 

(B) STREET: 1880 Century Patk East. Suite 500 
1 0 (C) CITY: Los Angeles 

(D) STATE: Califbmla 

(E) COUNTRY: USA 

(F) ZIP: 90067 



(V) COMPUTER READABLE FORM: 
15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC conipatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0. Version #1.25 

(vi) CURRENT APPLICATION DATA: 
20 (A) APPLICATION NUMBER: PCT 

(B) FILING DATE: 10-NOV-1994 

(C) CLASSIFICATION: 
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(vfii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Lisa A Halle 

(B) REGISTRATION NUMBER: P.38,347 

(C) REFERENCE/DOCKET NUMBER: FD-2912 

5 (Ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (619) 455-5100 

(B) TELEFAX: (619) 455-51 10 



(2) INFORMATION FOR SEQ ID N0:1: 
(I) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 28 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Onear 

(ii) MOLECULE TYPE: DMA (genomic) 
1 5 (viO IMMEDIATE SOURCE: 
(B) CLONE: K-ras primer 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..28 

20 (xO SEQUENCE DESCRIPTION: SEQ ID N0:1 : 

AGGAATTCAT GACTGAATAT AAACTTGT 28 
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(2) INFORMATION FOR SEQ ID N0:2: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE- nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: K-ras primer 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. 30 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
ATCGAATTCT ATGCATATTA AAACAAGATT 

(2) INFORMATION FOR SEQ ID N0:3: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nuddc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: p53-5S 
(ix) FEATURE- 

(A) NAME/KEY: CDS 
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(B) LOCATION: 1..26 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
GTAGGAATTC ACTTGTGCCC TGACTT 

(2) INFORMATION FOR SEQ ID N0:4: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nudelc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ONA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: p53-6AS 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..27 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
CATCGAATTC CACTGACAAC CACCCTT 

(2) INFORMATION FOR SEQ ID NaS: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nudeic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unear 

(R) MOLECULE TYPE: DNA (genomic) 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: p53-7S 
(ix) FEATURE 

(A) NAME^EY: CDS 

(B) LOCATION: 1..27 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
GTAGGAATTC CAAGGCGCAC TGGCCTC 

(2) INFORMATION FOR SEQ ID N0:6: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE'nucieicadd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi1) IMMEDIATE SOURCE: 

(B) CLONE: p53-8AS 
Qx) FEATURE 

(A) UMAEKEY: CDS 

(B) LOCATION: 1..26 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
ACTGAATTCTTCGTCTCCTC CACCGC 
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(2) INFORMATION FOR SEQ ID N0:7: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iO MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 
(B) CLONE: K-ras Vail 2 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..20 

(xi) SEQUENCE DESCRIPTION: SEQ ID 
TTGCCTACGC CAACAGCTCC 

(2) INFORMATION FOR SEQ ID N0:8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 
(B) CLONE: K-ras Asp 12 



-52- 



(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..20 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 
TTGCCTACGC CATCAGCTCC 

(2) INFORMATION FOR SEQ ID N0:9: 
(j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE* nudeic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: K-ras Ser12 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..20 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
TTGCCTACGC CACTAGCTCC 

(2) INFORMATION FOR SEQ ID NO:10: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nudeic add 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: K-ras Cys12 
(ix) FEATURE- 

(A) NAME/KEY: CDS 

(B) LOCATION: r. 20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
TTGCCTACGC CACAAGCTCC 

(2) INFORMATION FOR SEQ ID N0:11: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYP& nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unear 

(ii) MOLECULE TYPE: DNA (genomic) 
(viO IMMEDIATE SOURCE: 
(B) CLONE: p53 His273 
(ix) FEATURE 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

. (xO SEQUENCE DESCRIPTION: SEQ ID N0:1 1 : 
CACAAACATG CACCTCAA 
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(2) INFORMATION FOR SEQ ID N0:12: 
CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(viO IMMEDIATE SOURCE: 
(B) CLONE: p53 Gly281 
Ox) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 
TGCGCCGGCC TCTCCCA 

(2) INFORMATION FOR SEQ ID N0:13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nudeic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: finear 

(«) MOLECULE TYPE DNA (genomic) 
(vii) IMMEDIATE SOURCE: 
(B) CLONE: K-rasWT 
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(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 1..20 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
TTGCCTACGC CACCAGCTCC 

(2) INFORMATION FOR SEQ ID Nb:14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic add 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vTi) IMMEDIATE SOURCE 

(B) CLONE: p53Wr 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 
CCGGTTCATG GCGCCCAT 

(2) INFORMATION FOR SEQ ID N0:15: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nudeic acid 

(C) STRANDEONESS: single 
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(D) TOPOLOGY: linear 
(il) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..16 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CACCAGCCCC TCCTGG 

(2) INFORMATION FOR SEQ ID N0:16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iO MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 
TCTGGGCTGC TTGCATTC 
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(2) INFORMATION FOR SEQ ID N0:17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nudeic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:17: 
GCCAACTGCC CAAGACC 

(2) INFORMATION FOR SEQ ID N0:18: 
0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nudeic add 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(iO MOLECULE TYPE DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xO SEQUENCE DESCRIPTION: SEQ ID N0:18: 
GCCCTGTGTA GCTGTGG 
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(2) INFORMATION FOR SEQ ID N0:19: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19: 
GCAGCTGTGA GTTGATTC 

(2) INFORMATION FOR SEQ ID NO:20: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
TTCCACACAC CCGCCCG 
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(2) INFORMATION FOR SEQ ID N0:21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(10 MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.16 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 
CCCGCCCGTC ACCC6C 

(2) INFORMATION FOR SEQ ID NO:22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA (genomic) 
Ox) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..16 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:22: 
GCACCCGCTT CCGCGC 
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(2) INFORMATION FOR SEQ ID NO;23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: Onear 

(li) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CCGCGTCCTC GCCATG 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..16 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CGTCCGCGTC ATGGCC 
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(2) INFORMATION FOR SEQ ID NO:25: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: single 

(D) TOPOLOGY: linear 

Gi) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 1..17 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
CGCCATGGAC ATCTACA 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base paire 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
CGCCATGACC ATCTACA 
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(2) INFORMATION FOR SEC ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nudeic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
ACATGACGTA GGTTGTGA 

(2) INFORMATION FOR SEQ ID NO:28: 
(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAMEA<EY: CDS 

(B) LOCATION:!. 17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
CGGAGGTTTT GAGGCGC 
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(2) INFORMATION FOR SEQ ID NO:29: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xQ SEQUENCE DESCRIPTION: SEQ ID NO:29: 
TTGTGAGGCA CTGCCCC . 

(2) INFORMATION FOR SEQ ID NO:30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(IQ MOLECULE TYPE DNA (genomic) 
(ix) FEATURE 

(A) NAMEA<EY: CDS 

(B) LOCATION: 1..17 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:30: 
TTGTGAGGCT CTGCCCC 
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(2) INFORMATION FOR SEC ID N0:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..16 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31: 
AGGCGCTGGC CCCACC 

(2) INFORMATION FOR SEQ ID N0:32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base paiis 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA (genomic) 
Ox) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
AGGCGCTACC CCCACC 



^5- 

(2) INFORMATION FOR SEQ ID NO:33: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nuctejc add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE 

(A) NAME/KEY: CDS 

(B) LOCATION:!. 16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
AGGC6CTTCC CCCACC 

(2) INFORMATION FOR SEQ ID NO:34: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYP& nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(if) MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.. 17 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:34: 
CCCCCACTAT GAGCG' 
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(2) INFORMATION FOR SEC ID NO:35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ONA (genomic) 
(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
CCCCACCAGG AGCGCT 

(2) INFORMATION FOR SEQ ID NO:36: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nudelcacid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ONA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 1..16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
CCCCACCGTG AGCGCT 
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(2) INFORMATION FOR SEQ ID NO:37: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEONESS: single 

(D) TOPOLOGY: linear 
00 MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 
10 (B) LOCATION: 1..16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
CCCCCACGATGA6CGC 16 

(2) INFORMATION FOR SEQ ID NO:38: 
(0 SEQUENCE CHARACTERISTICS: 
1 5 (A) LENGTH: 17 base pairs 

(B) TYPE: nudeic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA (genomic) 
20 Ox) FEATURE: 

(A) NAME/KEY: CDS 
(8) LOCATION: 1.17 
(xO SEQUENCE DESCRIPTION: SEQ ID NO:38: 
TCAGCATCGTATCCGAG 17 
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(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 
10 (B) LOCATION: 1..19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
ATCCGAGTGT AAGGAAATT 1 9 



(2) INFORMATION FOR SEQ ID NO:40: 
0) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 19 base pairs 

(B) TYPE: nudeic add 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

m MOLECULE TYPE DNA (genomic) 
20 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..19 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:40: 
CGAGTGGAAG AAAATTTGC 19 



•69- 



(2) INFORMATION FOR SEQ ID N0:41: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(iO MOLECULE TYPE: DNA (genomic) 
fix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0:41 
TGTGGAGTGT TTGGATGA 

(2) INFORMATION FOR SEQ ID N0:42: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base paire 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sngle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
Ox) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..19 

(xD SEQUENCE DESCRIPTION: SEQ ID NO:42: 
TGGATGACTG AAACACTTT 
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(2) INFORMATION FOR SEQ ID NO:43: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
ACACTTTTTG ACATAGTGT 

(2) INFORMATION FOR SEQ ID NO:44: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unear 

01) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..19 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0:44: 
ACACTTTTCC ACATAGTGT 
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(2) INFORMATION FOR SEQ ID NO:45: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(10 MOLECULE TYPE: DNA (genomic) 
fix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 
CACTTTTCGA CGTAGTGTG 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: finear 

(ii) MOLECULE TITPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAMEA<EY: CDS 

(B) LOCATION: 1..19 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:46: 
TTTTCGACAT ATTGTGGTG 
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(2) INFORMATION FOR SEQ ID NO:47: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.. 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
ATAGTGTGTT GGTGCCCT 

(2) INFORMATION FOR SEQ ID NO:48: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE- nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 
Ox) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. 16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
GGTGCCCTGT GAGCCG 
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(2) INFORMATION FOR SEO ID NO:49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE: 

(A) NAMEfl<EY: CDS 

(B) LOCATION: 1.17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49; 
CCGCCTGACG TCTGGTT 

(2) INFORMATION FOR SEQ ID NO:50: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iO MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE- 

(A) NAMEfl<EY: CDS 

(B) LOCATION: 1.18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 
TCTGACTGAA CCACCATC 
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(2) INFORMATION FOR SEQ ID N0:51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(i«) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!.. 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:51: 
CATCCACTGC AACTACAT 



(2) INFORMATION FOR SEQ ID NO:52: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nudeic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 1.. 20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
CAACTACATA TGTAACAGTT 
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(2) INFORMATION FOR SEQ ID NO:53: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic add 

5 (C) STRANOEONESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: COS 
10 (B) LOCATION: 1..20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 
ACTACATGTT TAACAGTTCC 



(2) INFORMATION FOR SEQ ID NO:54: 
(0 SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 20 base pairs 

(B) TYPE: nudeic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iD MOLECULE TYPE: DNA (genomic) 
20 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1., 20 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:54: 
ACTACATGTG AAACAGTTCC 
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(2) INFORMATION FOR SEQ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAMEA<EY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 
CAGTTCCTCC ATGGGCG 

(2) INFORMATION FOR SEQ ID NO:56: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Oi) MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 
CAGTTCCTTC ATGGGCG 
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(2) INFORMATION FOR SEQ ID NO:57: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nudeic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 
CAGTTCCTGG ATGGGCG 

(2) INFORMATION FOR SEQ ID NO:58: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTX: 17 base pairs 

(B) TYPE: rujdeic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:58: 
CCTGCATGTG CGGCATG 
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(2) INFORMATION FOR SEQ ID NO:59: 
(I) SEQUENCE GHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TVPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAMEA^EY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 
GCATGQGCTG CATGAAC 

(2) INFORMATION FOR SEQ ID NO:60: 
0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:60: 
GCATGGGCGA CATGAAC 
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(2) INFORMATION FOR SEQ ID N0:61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nudeicadd 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xO SEQUENCE DESCRIPTION: SEQ ID N0:61: 
GGCGGCATCA ACCGGAG 

(2) INFORMATION FOR SEQ ID NO:62: 
0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nudeic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: finear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE: 

(A) NAMEA<EY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 
GGCATGATCC GGAGGCC 



wo V2j/ijjy/ 
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(2) INFORMATION FOR SEQ ID NO:63: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TVPE: nucleic add 

5 (C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 
Oi) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: COS 
10 (B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 
CATGAACCTG AGGCCCAT 18 

(2) INFORMATION FOR SEQ ID NO:64: 

(i) SEQUENCE CHARACTERISTICS: 
1 5 (A) LENGTH: 1 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
20 (Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:64: 
GCATGAACTG GAGGCCCA 18 
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(2) INFORMATION FOR SEQ ID NO:65: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IB base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ONA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 
GCATGAACCA GAGGCCCA 

(2) INFORMATION FOR SEQ ID NO:66: 
(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 
AACCGGAGTC CCATCCTC 
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(2) INFORMATION FOR SEQ ID NO:67: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. 17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 
AACCGGAGCC CCATCCT 

(2) INFORMATION FOR SEQ ID NO:68: 
(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nudeic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(10 MOLECULE TYPE: DNA (genomic) 
Ca) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 
AACCGGATGC CCATCCTC 
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(2) INFORMATION FOR SEQ ID NO:69: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nudeicacid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE DNA (genomic) 
(tx) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.16 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:69: 
GAACC6GGGG CCCATC 

(2) INFORMATION FOR SEQ ID NO:70: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:70: 
TCACACTGTA AGACTCCA 
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(2) INFORMATION FOR SEQ ID N0:71: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: Knear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE 

(A) NAME/KEY: CDS 

(B) LOCATION:!. 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:71: 
TCACACTGAA AGACTCCA 

(2) INFORMATION FOR SEQ ID NO:72: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nudeic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
Cix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:72: 
CACTGGAAGT CTCCAGGT 
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(2) INFORMATION FOR SEQ ID N0:7J 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE: 

(A) NAME«EY: CDS 

(B) LOCATION:!. .18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 
TAATCTACCG GGACGGAA 

(2) INFORMATION FOR SEQ ID NO:74: 
0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nudeic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Oi) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:74: 
TAATCTACCT GGACGGAA 
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(2) INFORMATION FOR SEC ID NO:75: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 
TCTACTGGTA CGGAACAG 

(2) INFORMATION FOR SEQ ID NO:76: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nudeic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: matjjeptide 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 
ACTGGGACCG AACAGCTT 
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(2) INFORMATION FOR SEQ ID NO:77: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic add 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: Unear 

(II) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION:!. 18 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:77: 
GCTTTGAGCT GCGTGTTT 

(2) INFORMATION FOR SEQ ID NO:78: 
0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nudeicacid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Inear 

(iO MOLECULE TYPE DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. 18 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:78: 
GCTTTGAGGA GCGTGTTT 
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(2) INFORMATION FOR SEQ ID NO:79: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE: 

(A) NAME/KEY; CDS 

(B) LOCATION: 1.. 18 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:79: 
TTGAGGTGCT TGTTTGTG 

(2) INFORMATION FOR SEQ ID NO:80: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
{B)TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(10 MOLECULE TYPE DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.. 18 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:80: 
TTGAGGTGTG TGTTTGTG 
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(2) INFORMATION FOR SEQ ID N0:81: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) T/PE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(10 MOLECULE TYPE: DNA (genomic) 
Cdc) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:81 
TTGAGGTGCA TGTTTGTG 

(2) INFORMATION FOR SEQ ID NO:82: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
P) TOPOLOGY: linear 

(iO MOLECULE TYPE: DNA (genomic) 
fa) FEATURE: 

(A) NAMEA<EY: CDS 

(B) LOCATION: 1..18 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:82: 
TTGAGGTGCC TGTTTGTG 
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(2) INFORMATION FOR SEQ ID NO:83: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE nucleic add 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYP& DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:B3: 
AGGTGCGTGG TTGTGCCT 

(2) INFORMATION FOR SEQ ID NO:84: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TirPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: inear 

(iO MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 
GCGTGTTTAT GCCTGCCT 
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(2) INFORMATION FOR SEQ ID N0:B5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE DNA (genomic) 
(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1B 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:85: 
GCGTGTTTTT GCCTGCCT 

(2) INFORMATION FOR SEQ ID NO:86: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nudeic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:86: 
TTGTGCCTTT CCTGGGAG 
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(2) INFORMATION FOR SEQ ID NO:87: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: finear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE: 

(A) NAMBKEY: CDS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:87: 
TGCCTGTCTT GGGAGAGA 

(2) INFORMATION FOR SEQ ID NO:88: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nudeic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(iD MOLECULE TYPE DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 
TGCCTGTTCT GGGAGAGA 
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(2) INFORMATION FOR SEQ ID NO:89: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .18 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:89: 
TGCCTGTCGT GGGAGAGA 

(2) INFORMATION FOR SEQ ID NO:90: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nudeic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: inear 

00 MOLECULE TYPE DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:90: 
TCCTGGGATA GACCGGCG 
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(2) INFORMATION FOR SEQ ID N0:91: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE- nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE 

(A) NAME/KEY: CDS 

(B) LOCATION:!. 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:91 
TGGGAGATAC CGGCGCAC 

(2) INFORMATION FOR SEQ ID NO:92: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

OD MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:92: 
GAGAGACCCG CGCACAG 
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(2) INFORMATION FOR SEQ ID NO:93: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iO MOLECULE TYPE: DNA (genomic) 
Cix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 
GAGAGACGGG CGCACAG 

(2) INFORMATION FOR SEQ ID NO:94: 
(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ONA (genomic) 
Cix) FEATURE 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: 
GAGAGACTGG CGCACAG 
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(2) INFORMATION FOR SEQ ID NO:95: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: 
AGACCGGCCC ACAGAGG 

(2) INFORMATION FOR SEQ ID NO:96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: 
AGACCGGGGC ACAGAGG 
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(2) INFORMATION FOR SEQ ID NO:97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(be) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:97: 
CCGGCGCCCA GAGGAA 

(2) INFORMATION FOR SEQ ID NO:98: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic add 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:98: 
GCGCACAAAG GAAcAGAA 
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(2) INFORMATION FOR SEQ ID N0:9g: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 8 base pairs 

(B) TYPE nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:g9: 
CACAGAGGGA GAGAATCT 

(2) INFORMATION FOR SEQ ID NO:100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:100: 
ATCTCCGCTA GAAAGGGG 
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(2) INFORMATION FOR SEQ ID NO:101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:101: 
GCAAGAAAGG GAGCCTC 

(2) INFORMATION FOR SEQ ID NO:102: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nudeicadd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE DNA (genomic) 
(ix) FEATURE: 

(A) NAMEACEY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:102: 
CTCACCACTA GCTGCCC 
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(2) INFORMATION FOR SEQ ID NO:103: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.. 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: 
GATGTTCTGA GAGCTGAA 

(2) INFORMATION FOR SEQ ID NO:104: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iD MOLECULE TYPE: DNA (genomic) 
Cdc) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:104: 
GATGTTCCAG AGCTGAAT 
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(2) INFORMATION FOR SEQ ID N0:105: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic actd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GGCCTTGAAC TCAAGGAT 

(2) INFORMATION FOR SEQ ID NO:106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAMEfl<EY: CDS 

(B) LOCATION: 1.. 16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: 
CTGGGAGACA CCGGCG 



•102- 



(2) INFORMATION FOR SEQ ID NO: 107: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 
ACCGGAGGTT CATCCTC 

(2) INFORMATION FOR SEQ ID NO:108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Knear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE 

(A) NAME/KEY: COS 

(B) LOCATION:!. 19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
CATGTGTAAA CAGTTCCTG 
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(2) INFORMATION FOR SEQ ID NO:109: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:108: 
AGCATCTTAA TCCGAGTG 

(2) INFORMATION FOR SEQ ID NO: 110: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
(B}TYPE: nudeicacid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(fi) MOLECULE TYPE: DNA (genomic) 
(a) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.. 17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:110: 
GTCTGGCCCT CCTCAGC 
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(2) INFORMATION FOR SEQ ID N0:111: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:111: 
GCTGCCCCCC ACCATGA 

(2) INFORMATION FOR SEQ ID N0:112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.. 15 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:112: 
CCGCGTCGCG CCATG 
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(2) INFORMATION FOR SEQ ID N0:113: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
(B}TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .17 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1 13: 
AGACCTGCCT GTGCAGC 

(2) INFORMATION FOR SEQ ID N0:114: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..17 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:114: 
CCTGTCCTTG GGAGAGA 
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(2) INFORMATION FOR SEQ ID N0:115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: K-ras Val12 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..20 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:115: 
GGAGCTGTTG GCGTAGGCAA 

(2) INFORMATION FOR SEQ ID N0:116: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(fi) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 
(B) CLONE: K-rasAsp12 
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(ix) FEATURE: 

(A) NAMB^'EY: CDS 

(B) LOCATION: 1.. 20 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:116: 
GGAGCTGATG GCGTAGGCAA 

(2) INFORMATION FOR SEQ ID N0:117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) T^PE: nudeic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TVPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: K-ras Ser12 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.^0 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:117: 
GGAGCTAGTG GCGTAGGCAA 
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(2) INFORMATION.^R SEQilD :N0:118: 

(i) rSEQUENCEmiARAC'^ISTICS: 
. 4A> LENGTH.-220 base pairs 
^)nTYPE: nudeic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DMA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: K-ras Cys12 
(ix) FEATURE 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..20 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0:118: 
GGAGCTTGTG GCGTAGGCAA 

(2) INFORMATION FOR SEQ ID N0:119: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE:.DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: WT K-ras 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..20 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:119: 
GGAGCTGGTG GCGTAGGCAA 

(2) INFORMATION FOR SEQ ID N0:120: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(viO IMMEDIATE SOURCE: 

(B) CLONE: Wrp53 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:120: 
ATGGGCGCCA TGAACCGQ 

(2) INFORMATION FOR SEQ ID N0:121: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: p53 His273 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!.. 18 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0:121: 
TTGAGQTGCA TGTTTGTG 

(2) INFORMATION FOR SEQ ID NO:122: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nudeic add 

(C) STRANDEDNESS: single 
P) TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 
(B) CLONE: p53Giy281 
Ox) FEATURE 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.. 17 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:122: 
TGGGAGAGGC CGGCGCA 
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ea) INFORMATION FOR SEQ ID NO:123: 
0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) T^E: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(H) MOLECULE TYPE: DNA (genomic) 
(viO IMMEDIATE SOURCE: 
(B) CLONE: K-ras probe 
10 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..28 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:123: 
ACAA6TTTAT ATTCAGTCAT GAATTCCT 28 



1 5 (2) INFORMATION FOR SEQ ID NO:124: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE' nucleic acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(iO MOLECULE TYPE: DNA (genomic) 
(ViO IMMEDIATE SOURCE: 
(B) CLONE: K-ras probe 
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(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:124: 
AATCTTGTTT TAATATGCAT AGAATTCGAT 

(2) INFORMATION FOR SEQ ID NO:125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: pS3 probe 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .26 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: 
AAGTCAGGGC ACAAGTGAATTCCTAC 

(2) INFORMATION FOR SEQ ID NO:126: 
(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(il) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: p53 probe 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.^7 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:126: 
AAGGGTGGTT GTCAGTGGAA TTCGATG 

(2) INFORMATION FOR SEQ ID NO:127: 
(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vil) IMMEDIATE SOURCE: 
(B) CLONE: p53 probe 
(ix) FEATURE: 

(A) NAMEACEY: CDS 

(B) LOCATION: 1.^7 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:127: 
GAGGCCAGTG CGCCTTGGAA TTCCTAC 
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(2) INFORMATION FOR SEQ ID NO:128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vll) IMMEDIATE SOURCE: 

(B) CLONE: p53 probe 
(Ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..26 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:128: 
GCGGTGGAGGAGACGAAGAATTCAGT 26 



The foregoing is meant to illustrate, but not to limit, the scope of the invention. 
Indeed,th08e of ordinary skill ^ the art can readily envision and produce further 
embodiments, based on the teachings herein, without undue experimentation. 
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GLMBS. 

1. A method for detecting a mammaiian target nucleic acid In a sputum 
specimen, which method comprises isolating the nucleic add present in the 
sputum specimen and detecting the presence of the target nucleic acid. 

2. The nrrethod of claim 1 , wherein the target nucleic acid is amplified t)efbre 
detecting. 

3. The method of claim 2, wherein the amplification is by means of 
oligonucleotides which hybridize to the flanking regions of the target nucleic 
add. 

4. The method of daim 1. wherein the target nudeic add comprises a 
mutation, a restriction fragment length polymorphism, a nucleic acid deletion, 
or a nucleic add substitution. 

5. The method of claim 4. wherein the mutation is associated with a neoplasm. 

6. The method of daim 5, wherein the neoplasm is of the lung. 

7. The method of daim 6, wherein the neoplasm is benign. 

8. The method of daim 6, wherein the neoplasm is malignant 
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9. The method of claim 1 , wherein the target nucleic acid associated with the 
neoplasm is selected from the group consisting of an oncogene and a tumor 
suppressor gene. 

1 0. The method of claim 9, wherein the oncogene is a memk)er of the /S5 femily. 

1 1 . The method of claim 1 0. wherein the oncogene is K-ras. 

12. The method of claim 9, wherein the tumor suppressor gene is p53. 

13. The method of claim 3. wherein the nucleotide sequence of the flanking 
region to which the oligonucleotide hybridizes is selected from 
5'-ACAAGTTTATATTCAGTCATGAATrCCT-3' (SEQ. ID NO. 123). 
5'-AATCTTGTTTTAATATGCATAGAATTCGAT-3' (SEQ. ID NO. 124), 
5'-AAGTCAGGGCACAAGTGAATTCCTAC-3' (SEQ. ID NO. 125), 
5'-AAGGGTGGTTGTCAGTGGAATTCGATG-3* (SEQ. ID NO. 126). 
5'-GAGGCCAGTGCGCCTTGGAATTCCTAC-3' (SEQ. ID NO. 127). or 5'- 
GCGGTGGAGGAGACGAAGAATTCAGT^' (SEQ. ID NO. 128) and 
sequences complementary thereto. 
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14. The method of claim 13. wherein the oligonucleotide is 
5'^GGAATTCATGACTGAATATAAACTTGT-3' (SEQ. ID NO. 1). 
5'-ATCGAATTCTATGCATATTAAAACAAGATT-3' (SEQ. ID NO. 2), 
5'-GTAGGAATTCACTTGTGCCCTGACTr-3' (SEQ. ID NO. 3). 

5 5*-CATCGAATTCCACTGACAACCACCCTT-3' (SEQ. ID NO. 4), 

5'-GTA6GAATrCCAAGGCGCACTGGCCTC-3' (SEQ. ID NO. 5). or 
S'-ACTGAATTCTTCGTCTCCTCCACCGC-y (SEQ. ID NO. 6) and 
sequences complementary thereto. 

15. The method of claim 1 , wherein the target nucleic add is detected using a 
nucleotide hybridization prok)e. 

16. The method of claim 15. wherein the target nudeic add to which the 
nucleotide hybridization probe hyt}ndlzes is selected from 
5'-TTGCCTACGCCAACAGCTCC-3' (SEQ. ID NO. 7). 
5'-TT6CCTACGCCATCAGCTCC-3' (SEQ. ID NO. 8). 

5 5'-TTGCCTACGCCACTA6CTCC-3' (SEQ. ID NO. 9), 

5'-TT6CCTACGCCACAAGCTCC-3' (SEQ. ID NO. 10). 
5'^CAAACAT6CACCTCAA-3' (SEQ. ID NO. 11). 
y-TGCGCCGGCCTCTCCCA-S' (SEQ. ID NO. 12). 
5'-CCGGTTCAT6GCGCCCAT-3' (SEQ. ID NO. 14) or 
10 5'-TTGCCCACGCCACCAGCTC03' (SEQ. ID NO. 13). and 

sequences complementary thereto. 
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17. The method of claim 16, wherein the nucleotide hybridization probe is 
selected from 

5'-GGAGCTGTrGGCGTAGGCAA-3' (SEQ. ID NO. 115). 
5'-GGAGCTGATGGCGTAGGCAA-3' (SEQ. ID NO. 116). 
5 5*-GGA6CTAQTGGCGTAGGCAA-3' (SEQ. ID NO. 117). 

S'-GGAGCTTGTGGCGTAGGCAA-S (SEQ. ID NO. 118). 
5'-TTGAGGTGCATGTTTGTG-y (SEQ. ID NO. 121). 
5'-TGGGAGAGGCCGGCGCA-3' (SEQ. ID NO. 122). 
5'-ATGGGCGCCATGAACCGG-3' (SEQ. ID NO. 120). or 
1 0 5'<GGAGCTGGTGGC(3TAGGCAA-3' (SEQ. ID NO. 1 19). and 

sequences complementary thereto. 

18. A kit useful for the detection of a target nucleic add from a sputum 
specimen, the kit comprising carrier means being compartmentalized to 
receive in dtose confinement therein one or more containers comprising a 
container containing a hybridization probe which hybridizes to the target 

5 nudeic add. 

19. The kit of daim 18. further comprising a container containing an 
oiigonudeotkle primer for amplification of the target nudeic add. 

20. The method of daim 18, wherein the probe is detectably labeled. 
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